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ABSTRACT 

A comparison between parameter optimal and PID type iterative learning control with the application on a robot 
arm is presented in this paper. Both control schemes are based on the principle of iterative learning control which the data 
from the past trial is used to modify the system performance. Specifically, they both also use the optimal gain which its 
solution will converge in norm to zero. The control design is simple in the sense that the requirement is to track the 
demanding reference target with minimum error. In this framework, the desired trajectory measured by the angle of the 
joint is concerned. The results show that the PID type iterative learning control give better performance than the other as 
it generates faster convergence. The numerical experimentation suggests however that the use of PID type is capable to 
track the desired trajectory precisely. 
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1. INTRODUCTION 

Iterative learning control (ILC) is a technique to control the systems that perform the same task repeatedly 
and periodically. It is concerned with trajectory tracking control problem, where the required trajectory is repeated 
over a finite duration known as the trial or iteration length. This mostly applies to robot manipulators that are 
required to repeat a given task with high precision. Other examples are chemical batch process, automated 
manufacturing plant or more generally, the class of tracking systems. The general objective of iterative learning 
control is to use the repetitive nature of the process to gradually improve the precision. Amann et a/. ( 1996) states 
that the basic idea of the algorithm is to suitably use information from previous trials, often in combination with 
appropriate current trial information, to select the current trial input to sequentially improve performance from trial 
to trial in the sense that the tracking error is gradually reduced. More specifically, during the execution of algorithm 
at the yth iteration, the data as control input and error are recorded. These information are used in the learning 
process at the (/+l)th iteration to refine the control input and progressively reduce the error. After a number of 
iteration, the appropriate control input should be obtained and this input finally generates the desired output 
correctly. 

Since the early introduction of iterative learning control methodology by Arimoto et al. (1984), the general 
area of ILC has been progressively made by several researchers both in term of underlying theory and application. 
The examples for the theoretical aspect are found in Owens and Feng (2003), Owens, Chu, and Songjun (2012), and 
Madady (2008). Owens and Feng presents the parameter optimization through a quadratic performance index 
introduced as a method to establish a new control law. This new algorithm guarantees the monotonic convergence 
of the error to zero if the original system is a discrete-time LTI system. In Owens, Chu, and Songjun, a 
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multi-parameter optimal ILC algorithm that uses an approximate polynomial representation of the inverse plant is 
introduced. This algorithm is capable of producing the improvements of the convergence rate as the number of parameters 
increases. Madady purposes a proportional plus integral and derivative type ILC. This is the new technique using the 
principle of optimal design to determine the PID coefficients. It can guarantee the monotonic convergence under some 
given conditions. The examples of the application aspect are reviewed in Tayebi (2004), Lee and Lee (2007), and Sun, 
Hou, and Li (2013). They all are about to utilize the iterative learning control method in the application to robot 
manipulator, batch process system and train trajectory tracking system respectively. 

In this paper, two iterative learning control schemes for the trajectory tracking problem of rigid robot 
manipulators are presented and compared. One is parameter optimal iterative learning control using polynomial 
representations of the inverse plant. Tire other is the PID type iterative learning control. The convergence of the tracking 
error to zero over the finite time interval is guaranteed when the iteration number tends to infinity. The paper is organized 
as follows: In section 2 the POILC using polynomial representations of inverse plant is explained in detail. 
The PID type iterative learning control is also described in section 3. They both are then applied to the robot manipulator 
and an illustrative simulation test results are given in section 4. Finally some conclusions are made in section 5. 

2. POILC USING POLYNOMIAL REPRESENTATIONS OF INVERSE PLANT 

The starting point of POILC using polynomial representations of inverse plant is to consider the system in the 
discrete model state-space form which the states, inputs and outputs are assumed to be sampled at intervals h over a time 
interval [0,Aj and the number of samples T = N/h is known. The process model is written in the state-space form 


x(t + 1) = A x(t) + B u (t), 
y(t) = Cx(t) + Du{t ) 


x(0) = x 0 


( 1 ) 


where the matrices A, B, and C have an appropriate dimensions and D is a scalar. It is useful in the analysis to 
replace the plant model (1) by a matrix model relating a time series vector of inputs to a time series vector of outputs for 
each trial. To construct this matrix model, define the time series input and the time series output on the Ath trial via 


“t = [“*(0). “*(!),••* ,u t (N - m )] r , 
y t = + 1),... ,y k {N )] T , 

r = [r k {m),r k (m + 1),... ,r k (N)] T 


( 2 ) 


form 


With the above definitions, the usual formula for input-output response of the system can be written in the matrix 


y k = Gu t + d 0 


(3) 


where G is called the Markov parameters of the plant (1) and has the lower triangular band structure. 
For simplicity, it is assumed that m= 1. Therefore the dimension of G is N *N which can be written as 

(4) 
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Furthermore, the tracking error at iteration k is defined as e k = r— Gu k - d— (r — </)— Gu k and hence, without loss 
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of generality, it is possible to replace r by r - d and therefore to assume that d = 0 in what follows. Equivalently, it is 
possible to assume that x 0 = 0. 

The repetitive nature of the problem permits the iterative modification of the input function u(t) so that, as the 
number of iteration increases, the system learns the input function that gives perfect tracking. More precisely, the control 
objective is to find a recursive algorithm denoted by/in the formula 


*t+i 


f ( U k ’ U k-\ ’ • • ' 11 k-r ’ e k + 1 ’ • ' • ’ e k-s ) 


(5) 


with the properties that, independent of the control input u 0 chosen for the first trial, 

lim lie, 11 = 0 and lim L, -w* =0 (6) 

»co " ^ k— »co II II 


where II ■ II denotes any norm for the time series. 

Motivation using the Cayley-Hamilton Theorem, the following approximation to the inverse ILC algorithm is 
presented 

« i+ . = «* + £ k>0 (7) 

7=1 


Based on a natural multi-parameter generalization of the POILC algorithm, this algorithm specifies the parameter 
vector [ik\ i by solving the optimization problem 


Pt^ = ar S “in {T t + 1 (/J t + 1 ) : e t+I = r - G u t+1 - d „ } 


( 8 ) 


where the objective function 


j(P k+l ) = lh +1 |f + 

and W klI = W T k+ 1 > 0 is weighting matrix introduced to ensure good conditioning of the algorithm. 

Using e k = r - Gu k , k>0, the error evolution equation becomes 

e k + 1 = e k + GG '(fik+i’ G ) e k +l 

The necessary and sufficient condition for optimality is that dJ/df} k+ \ = 0, which gives the compact formula 
A + . = iW k+l + M T (e k )M {e k ))~'M T (e k )e k 


(9) 


( 10 ) 


(ID 


where 


M (e l ) = [Ge k ,G 1 e l ,... ,G u e J (12) 

This algorithm has the desirable property as the error norm is monotonicity. It also ensures the boundedness of 
signals and a sequential improvement in the mean square tracking error from iteration to iteration. 

3. PID TYPE ITERATIVE LEARNING CONTROL 

The starting point of the PID type iterative learning control is to consider the following updating law to determine 
the input of the system ( 1 ) 
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u k+1 (t) = u k (t) + Au kkl (t) (13) 

where A u k+] [f) is a term modified by using PID scheme. Hence A is chosen as follows 

A k* + 1 (0 = k p e k (t + 1) + k,£ e t (m) + k D (e k (t + 1) - e t (t)) ( 14 ) 

Therefore, the relations of (13) and (14) give 

«* + i(0 = MO + k P e k {t + 1) + k e k (m) + k D (e k (t + 1) - e k {t)) ( 15 ) 

m= 1 

where the error is defined as 

MO = yAt)- MO for 1 < t< N ( 16 ) 

kp, kj, and k D are real constant coefficients which is called proportional, integral and derivative learning gains 
respectively. 

The gains k P , ki, and k D will be determined in order to satisfy 

lim (MO- MO) = 0 for t = 1 , 2 ,... ,N. (17) 

The limit shown in (17) indicates that perfect tracking accuracy is ultimately achieved as the number of iterations 
becomes infinite. 

The system (1) can be easily transformed to the following relation 

Y (k) = G U (k) + G „x 0 , k = 0,1,2,... (18) 

where U(k) and Y(k ) are input and output vector respectively in iteration k as 

U{k) = \u k { 0) u k ( 1) u k (N- l)f, (19) 

Y(k) = [y t ( 1) y t ( 2) - y t (N)f 

Go is the matrix, G 0 = [C A C A 1 ■■■ C A N Y an< l G I s fh e same matrix as shown in (4). 

The system relation shown in (6) can be rewritten in the form 

Y (k + 1) = Y (k) + GV(k) k= 0,1,... (20) 

where 

V(k) = U(k + 1) - U(k) (21) 

The error dynamic of the system can be obtained from (20) as 

E (k + \) = E (k) - G V (k) k = 0,1,... (22) 

where E(k) is the error vector at iteration k which is defined as 
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E(k) = Y d -Y(k) = [e t ( 1) e k (2) - e k (N)f , (23) 

Y d =[y d ( 1) J\,(2) - y rf (A)] r 

From the definition of V(k) and E(k ), the control law (15) can be written in the following compact form as 
V (k) = {( k p + k D )l + k I F 1 - k D F 2 }E (k) for k = 0,1,... (24) 


where I is N x N identity matrix and F\ and F 2 are defined as 
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Substituting V{k) from (24) into error dynamic equation (22) yields 

E(k + l)=G c ,E(k) k = 0 , 1 , . . . (26) 

where G c is a matrix obtained by 

G e = / - (k p + k D )G - k,GF t + k d G F 2 (27) 

To achieve a tracking accuracy (17), it depends on matrix G c in which the gains k P , k h and k D should be selected 
appropriately. 

This algorithm guarantees the monotonic convergence if the sufficient condition is obtained in terms of system 
Markov parameters and the appropriated gains of the optimal PID approach are selected. 

4. SIMULATION RESULTS 

In order to have some idea about the effectiveness of these two algorithms in term of reducing tracking error and 
see how the comparison of the accuracy to the target of both algorithms would be, a simulation study is conducted. 
Let consider one-link one degree of freedom arm with revolute joint shown in Figure 1 . 



This system is described by the state space form as follows (see Saha (2008)) 
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y i = y 2 

3 f 1 . Q 

y 2 = r r _ sm 0 

may 2 


(28) 


Where the 2-dimensional state-space vector is given by y = [yi y 2 ] r = [8 dO/dt] T . Also a is the length of the link, m 
is the mass of the link, 8 is angle of the link, and r is the torque input to the system. This system is to determine the input 
torque r applied to the link so that the angle of the link 8 approaches the desired given angle 8 d in time interval [0, 7], By 
increasing the number of iterations, error between 8 and 8 d is gradually reduced and becomes zero eventually. 
The physical parameters and desired trajectory are listed as follows. 

Physical parameters: m = 0.5 kg, a = 0.3 m, g = 9.81 m/s 2 

Desired trajectory: 8 d = 5sin0.5t for /£ [0,14], the time interval T= 14 



Figure 2: The Output Trajectory of Control Law (7) at the Iteration k = 1, 4 and 10 

Applying the control law (7) with M = 10 and the desired trajectory over the time interval. A small value of W = 
10" 6 / is selected to provide maximal convergence rates whilst avoiding and numerical problems in evaluating 
Simulation results are shown in Figure 2 and 3. 



Figure 3: The Tracking Error Versus the number of Iterations for the Link of Control Law (7) 
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The results indicate that by increasing the iteration number, the output converges to the given desired trajectory. 
It is obviously shown that the convergence is monotonic as the tracking error is gradually decreased as the iteration number 
increases. 

Again, the control law (15) is applied with the optimum selection of gains k P , ki, and k D according to the 
procedure is shown step by step in Madady (2008). Hence the optimal coefficients k P , kj, and k D is chosen as k P , k h and k D 
equal to -0.0877, 1.358 x 10" 4 , and 43.291 respectively. The same desired trajectory is also selected with the same time 
interval. This leads to the results given in Figure 4 and 5. 



Figure 4: The Output Trajectory of Control Law (15) at the Iteration k = 1, 4 and 10 



Figure 5: The Tracking Error Versus the Number of Iterations for the Link of Control Law (15) 

The results obviously show that the output can track the desired trajectory as the number of iteration increases. 
It is also showed that the convergence in this simulation is monotonic. 

In order to see how different between POILC using polynomial representations of inverse plant algorithm and PID 
type algorithm can track the desired trajectory precisely, the convergence of both algorithms are compared and shown in 
Figure 6. 
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Figure 6: The Convergence Comparison between Two Algorithms 

It is apparently seen in Figure 6 that both algorithm has the same trend of convergence as they are monotonic. 
However, it is seen that applying PID type ILC algorithm can improve the convergence more than POILC using 
polynomial representations of inverse plant algorithm. The angle of the link when applying the last algorithm can track the 
desired position trajectory more quickly and the convergence rate is much more. 

5. CONCLUSIONS 

In this paper two iterative learning control algorithms, POILC using polynomial representations of inverse plant 
and PID type ILC, have been explained and applied to the position tracking problem of one link robot manipulator. 
These two algorithms are based on parameter optimization which the suitable parameters are selected during each 
repetition in order to improve performance. Results from simulation application of these control laws to robot manipulator 
are presented and also show that control over performance aspects is possible. The results of both algorithms show the 
same behaviors. The position tracking errors monotonically decrease with the increase of the iteration number. 
However, the PID type ILC gives a better performance as it is efficiency to track the desired trajectory more accurately and 
faster under the same number of iteration. The extending to control the two-link robot manipulator with these algorithms is 
interesting in the future research. 
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